Probability Inequalities for Kernel Embeddings in Sampling without Replacement

نویسنده

Markus Schneider

چکیده

The kernel embedding of distributions is a popular machine learning technique to manipulate probability distributions and is an integral part of numerous applications. Its empirical counterpart is an estimate from a finite set of samples from the distribution under consideration. However, for large-scale learning problems the empirical kernel embedding becomes infeasible to compute and approximate, constant time solutions are necessary. One can use a random subset of smaller size as a proxy for the exhaustive set of samples to calculate the empirical kernel embedding which is known as sampling without replacement. In this work we generalize the results of Serfling (1974) to quantify the difference between the full empirical kernel embedding and the one estimated from random subsets. Furthermore, we derive probability inequalities for Banach space valued martingales in the setting of sampling without replacement.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monte Carlo Filtering Using Kernel Embedding of Distributions

Recent advances of kernel methods have yielded a framework for representing probabilities using a reproducing kernel Hilbert space, called kernel embedding of distributions. In this paper, we propose a Monte Carlo filtering algorithm based on kernel embeddings. The proposed method is applied to state-space models where sampling from the transition model is possible, while the observation model ...

متن کامل

Localized Complexities for Transductive Learning

We show two novel concentration inequalities for suprema of empirical processes when sampling without replacement, which both take the variance of the functions into account. While these inequalities may potentially have broad applications in learning theory in general, we exemplify their significance by studying the transductive setting of learning theory. For which we provide the first excess...

متن کامل

Lattice Paths, Sampling without Replacement, and the Kernel Method

In this work we consider weighted lattice paths in the quarter plane N0 × N0. The steps are given by (m, n) → (m − 1, n), (m, n) → (m, n − 1) and are weighted as follows: (m, n)→ (m− 1, n) by m/(m + n) and step (m, n)→ (m, n− 1) by n/(m + n). The considered lattice paths are absorbed at lines y = x/t− s/t with t ∈ N and s ∈ N0. We provide explicit formulæ for the sum of the weights of paths, st...

متن کامل

Nearly Tight Oblivious Subspace Embeddings by Trace Inequalities

We present a new analysis of sparse oblivious subspace embeddings, based on the ”matrix Chernoff” technique. These are probability distributions over (relatively) sparse matrices such that for any d-dimensional subspace of R, the norms of all vectors in the subspace are simultaneously approximately preserved by the embedding with high probability–typically with parameters depending on d but not...

متن کامل

The splitting design that leads to simple random sampling

Implementing unequal probability sampling, without replacement, is very complex and several methods have been suggested for its performance, including : Midseno design and systematic design. One of the methods that have been introduced by Devil and Tille (1998) is the splitting design that leads to simple random sampling .in this paper by completely explaining the design, with an example, we ha...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Probability Inequalities for Kernel Embeddings in Sampling without Replacement

نویسنده

چکیده

منابع مشابه

Monte Carlo Filtering Using Kernel Embedding of Distributions

Localized Complexities for Transductive Learning

Lattice Paths, Sampling without Replacement, and the Kernel Method

Nearly Tight Oblivious Subspace Embeddings by Trace Inequalities

The splitting design that leads to simple random sampling

عنوان ژورنال:

اشتراک گذاری